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Abstract 

If  a  linear  program  (LP)  possesses  a  large  generalized  network  (GN)  subma¬ 
trix,  this  structure  can  be  exploited  to  decrease  solution  time.  The  problems  of 
finding  maximum  sets  of  GN  constraints  and  finding  maximum  embedded  GN 
submatrices  are  shown  to  be  NP-complete  indicating  that  reliable,  efficient  solu¬ 
tion  of  these  problems  is  difficult.  Therefore,  efficient  heuristic  algorithms  are 
developed  for  identifying  such  structure  and  are  tested  on  a  selection  of 
twenty-three  real-world  problems.  The  best  of  four  algorithms  for  identifying  GN 
constraint  sets  finds  a  set  which  is  maximum  in  twelve  cases  and  averages  99.1% 
of  maximum.  On  average,  the  CN  constraints  identified  comprise?'  more  than 
62.3%  of  the  total  constraints  in  these  problems.  The  algorithm  for  identifying 
embedded  GN  submatrices  finds  submatrices  whose  sizes,  rows  plus  columns, 
average  96.8%  of  an  LP  upper  bound.  Over  91.3%  of  the  total  constraint  matrix 
was  identified  as  a  GN  submatrix  in  these  problems,  on  average. 

"The  act  of  being  wise  is  the  act  of  knowing  what  to  overlook." 

William  James  (ca.  1890) 

1.  Introduction 

Large-scale  linear  programming  (LP)  models  frequently  have  sparse 
coefficient  matrices  with  special  structure.  If  special  structure  can  be 
identified,  it  can  often  be  exploited  to  reduce  the  cost  cf  solving  the  LP.  "Direct 
factorization,"  e.g.  [13],  maintains  a  partitioning  of  the  rows  and/or  columns  of 
all  simplex  bases.  Computations  are  reduced  with  respect  to  standard  methods 
if  special  structure  can  be  isolated  within  the  partitions.  "Decomposition,"  e.g. 
[14],  splits  a  problem  into  a  master  problem  and  one  or  more  subproblems.  This 
technique  is  most  efficient  when  subproblems  consist  entirely  of  special  struc¬ 
ture  allowing  their  rapid  solution.  The  details  of  these  exploitation  schemes  will 
not  be  discussed  here. 


Useful  structures  found  embedded  in  a  subset  of  the  rows  and/or  columns 
of  an  l.P  constraint  matrix  include  simple  upper  bounds  (at  most  one  nonzero 
element  in  each  row),  generalized  upper  bounds  (GUU)  (at  most  one  nonzero 
coefficient  in  each  column),  and  networks  (at  most  two  nonzero  elements  in 
each  column).  Varieties  of  embedded  networks  include  the  general  case,  gen¬ 
eralized  networks  (GN),  generalized  transshipment  networks  (GT)  (at  most  one 
coefficient  not  equal  to  +1),  and  pure  networks  (NET)  (at  most  one  +1  and  one  -1 
in  each  column). 

Simple  upper  bounds,  GUD  and  NET  structures  have  been  exploited  in  vari¬ 
ous  commercial  and  experimental  optimization  systems,  and  efficient  automatic 
identification  schemes  have  been  developed  to  find  (hose  structures,  c.g., 
[4.7, B]. 

Recent  research  has  produced  very  efficient  specialized  simplex  algorithms 
for  solving  network  problems  (For  example,  sec  [3]  for  NET,  j C>]  for  GN,  and 
[6.11]  for  GT.).  This  research  has.  in  turn,  been  exploited  to  develop  faetori/ed 
optimization  systems  which  solve  general  l.P  problems  with  a  set  of  rows  exhibit¬ 
ing  NET  structure  [12],  GN  structure  [  1 0],  and  GT  structure  [12,10],  Even  more 
recently,  optimization  systems  have  been  tested  which  use  direct  factorization 
[19]  or  primal  and/or  dual  decomposition  [  1-1  ]  to  exploit  embedded  GN  struc¬ 
ture. 

Nour  that  software  is  available  to  solve  GN  (and  GT)  problems  [6],  it  is  very 
likely  that  several  resc'arch  groups  will  exploit  GN  in  various  wavs  in  the  near 
future.  To  support  this  research,  we  are  interested  in  efficiently  and  automati¬ 
cally  identifying  GN  structure  of  the  following  varieties  in  general  l.P  coefficient 
matrices: 

GN’e  A  subset  of  l.P  columns  winch  are  GN,  or 

GNr  A  subset  of  LP  rows  which  arc  GN,  or 

GNn.c  An  embedded  GN  within  a  subset  of  the  rows  and  columns  eT  1,P. 

Because  the  efficiency  of  solving  a  general  LP  with  CN-cxploiling  methods  is 
enhanced  if  the  GN  structure  is  l.irgc,  marLmum  GN  structures  are  our  goal. 
This  leads  to  the  maximization  problems  described  below. 

Let  A=}a,;}  be  the  mxn  coefficient  matrix  of  LP,  and  let  H=[Ay]  be  the 
associated  0-1  incidence  matrix  for  A  The  three  maximization  problems,  formu¬ 
lated  as  integer  programs,  are 

iV(GNc):  max  V  c 

c  7 

s  t.  £  Ay  cj  <  2  for  all  j 
i 

c,  c  jo.  i;. 

where  Cj  is  a  binary  decision  variable  indicating  inclusion  of  column  j  in  GNc; 

/./(GNr):  max^n 

"  » 

s  t  V  AyT\  ^  2  for  all  j 

i 

r,  e  [O.IJ. 

where  rt  is  a  binary  decision  variable  indicating  inclusion  of  row  i  in  GNR;  and 
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^(GNrc):  max  2  ri  +  £  c, 

k.c  t  i 

s.t.  2  h*jri  +  rnici  s  2  +rr^  for  all  j 

i 

r<,  Cj  e  jO.lj. 

where  r*  and  c;  are  binary  decision  variables  indicating  respective  inclusion  of 
row  i  and  column  j  in  GNrc.  and  where  m;  s  2  _  2-  Note  that  our  definitions 

i 

of  maximum  GN  factorizations  are  expressed  simply  as  the  sum  of  the  rows 
and/or  columns  included. 

Much  work  has  been  done  on  the  development  of  algorithms  to  identify  spe¬ 
cial  substructures  in  LPs.  Previous  work  in  identifying  GUB  subsets  of  con¬ 
straints  is  well  known  [4.7].  Brown  and  Wright  [B]  have  explored  ways  to  identify 
NET  subsets.  Extraction  of  hidden  NET  structure  with  general  linear  transforma¬ 
tions  has  been  discussed  by  Bixby  and  Cunningham  [2]  and  by  Musalem  [20], 
Identification  of  GN  row  sets  and  other  structures  has  been  proposed  by  Schrage 
[21]. 

The  problems  of  identifying  maximum  GUB  and  NET  constraint  subsets  are 
NP-complele,  and  consequently,  exact  solutions  cannot  be  guaranteed  to  be 
obtained  quickly.  Since  GUB  and  NET  constraints  are  special  cases  of  GN  con¬ 
straints,  it  is  to  be  expected  that  exact  solutions  of  the  GN  identification  prob¬ 
lems  will  also  be  difficult  to  obtain.  We  show  that  the  GN  identification  problems 
are,  in  fact,  NP-complete,  but  also  give  effective  and  reliable  heuristic  algo¬ 
rithms  for  them. 

In  section  2,  the  complexities  of  the  three  maximization  problems  are 
investigated.  A/(GNr)  and  Af  (GNrc)  are  shown  to  be  difficult  so,  in  section  3, 
efficient  algorithms  are  developed  for  finding  approximate  solutions  to  these 
problems.  Four  specialized  integer  programming  heuristics  are  described  for 
identifying  maximal  GNr  sets.  Two  of  the  algorithms  are  "addition''  heuristics 
which  begin  with  the  empty  GNr  set  and  successively  add  rows  while  maintaining 
feasibility.  The  other  two  algorithms  are  "deletion”  heuristics  which  begin  with 
an  infeasible  GNr  set  and  successively  delete  rows  until  a  feasible  set  is  found. 
Algorithm  GNRC  for  A/ (GNrc)  takes  as  input  the  GNr  set  found  by  any  one  or  the 
the  GNr  heuristics.  Then,  it  successively  adds  rows  which  introduce  the  least 
amount  of  weighted  infeasibility  and  drops  those  columns  where  an  infeasibility 
results.  In  this  way,  a  sequence  of  GNrc  sets  is  produced  and  the  maximum  of 
these  taken  to  be  the  heuristic  solution  to  A/(GNr,c).  After  the  algorithms  are 
presented,  computational  experience  is  given  in  section  4. 

2.  Complexity 

In  this  section  we  investigate  the  complexity  of  A/(GNc),  A/(GNr),  and 
Af(GNR,c).  Af(GNc)  is  trivially  solvable  in  polynomial  time  by  choosing  all  columns 
with  at  most  two  nonzero  elements  in  them;  consequently,  its  complexity  will  not 
be  discussed  further.  The  other  two  problems  are  more  interesting. 

Following  standard  practice,  A/(GNr)  and  A/(GNrc)  will  be  studied  with 
respect  to  their  associated  decision  problems: 

^(GNr):  Does  there  exist  a  set  of  rows  R  in  II  such  that,  for  positive  integer 

k<m, 

\R  \  ^  k  and  2  Ni  -  2  for  all  j  ? 

icJt 


D{  GNRC):  Does  there  exist  a  set  of  rows  R  and  columns  C  in  H  such  that,  for 
positive  integer  k  <m+n, 

|J?|  +  |C|fc£  and  V  ^  *  2  forall>eC? 

Of  course,  a  polynomial  algorithm  for  one  of  the  above  decision  problems  would 
imply  a  polynomial  algorithm  for  the  associated  maximization  problem  using, 
say,  a  binary  search  on  the  values  of  A:. 

We  consider  the  complexity  of  2?(GNr,c)  first.  Yannakakis  [24]  investigated 
the  problem  of  finding  the  least  number  of  nodes  which  can  be  deleted  from  a 
bipartite  graph  such  that  the  resulting  induced  subgraph  has  a  particular  pro¬ 
perty.  Restated  in  terms  of  the  decision  problem,  he  gives  the  following 
theorem  on  0-1  matrices  as  a  corollary  of  his  results  on  graphs. 

Theorem  1:  Let  Q  be  any  class  of  0-1  matrices  which  is  closed  under  permuta¬ 
tion  and  deletion  of  rows  and  columns.  Let  H  be  an  mxn  0-1  matrix,  and  let  k 
be  some  positive  integer,  k  <  m+n.  Then,  finding  an  m0xn0  submatrix  Hq  of  II 
such  that  Hq  e  Q  and  m,0+n0  ^  A:  is  polynomial  if  the  matrices  of  Q  have  bounded 
rank  and  is  NP-complete  otherwise. 

It  is  assumed  above  that  membership  in  Q  can  be  determined  in  polynomial  time 
for  a  matrix  of  bounded  size  (otherwise,  NP-hardness  would  be  implied). 

This  theorem  is  impressive  in  that  it  handles  the  NP-completeness  question 
for  0-1  matrices  in  a  wholesale  fashion.  The  NP-completeness  of  Z?(GNr,c)  follows 
as  a  simple  corollary. 

Corollary  1:  Z7(GNrc)  is  NP-complete. 

Proof:  Let  Q  be  the  class  of  0-1  matrices  with  at  most  two  Is  in  each  column.  Q 
is  obviously  closed  under  permutation  and  deletion  of  rows  and  columns; 
matrices  of  arbitrarily  large  rank  can  be  found  in  Q;  and  membership  in  Q  can 
be  determined  in  polynomial  time.  Z?(GNrc)  for  the  incidence  matrix  II  is 
equivalent  to  searching  for  an  m0xn0  submatrix  Hq  of  II  such  that  IIq^Q  and 
m0+n0  ^  k.  Therefore,  by  Theorem  1,  Z7(GNRC)  is  NP-complete.  ■ 

A  0-1  matrix  H  is  represented  as  a  bipartite  graph  with  nodes  on  one  side  of 
the  bipartition  corresponding  to  rows,  nodes  on  the  other  side  of  the  bipartition 
corresponding  to  columns,  and  an  edge  ( i,j )  for  each  Ay  =1  .  D(GNr) 
corresponds  to  a  node-deletion  problem  with  deletions  restricted  to  one  side  of 
the  bipartition;  Yannakakis's  results  do  not  directly  apply  since  they  pertain  to 
node  deletions  on  either  side  of  the  bipartition*.  Therefore,  we  use  a  problem- 
specific  proof  to  show  that  Z?(GNr)  is  NP-complete. 

Lemma  1.  ZJ(GNr)  is  NP-complete. 

Proof:  For  ease  of  representation,  U(GNr)  will  be  equivalently  stated  in  matrix 
notation: 

D( GNr):  Does  there  exist  a  binary  m -vector  X  such  that  lx>  A:  and  Hrx^  2? 

D( GNr)  is  obviously  in  NP.  We  show  that  it  is  NP-complete  by  a  transformation 
from  the  "Exact  Cover  by  3-Sets"  problem  [15],  as  specialized  by  Garey  and 
Johnson  [  10]. 

D(X 3C):  Does  there  exist  a  binary  p-vector  y  such  that  ly  =  g  and  Ny  =  1 

where  N  is  a  3gxp,  0-1  matrix  with  exactly  three  Is  in  each  column 
and  at  most  three  Is  in  each  row? 

1  Bartholdi  [1]  has  addressed  this  topic,  but  his  results  are  incorr.plete.  For  instance, 
without  additional  restrictions,  his  Theorem  2  would  imply  that  0(GNc)  is  NP-complete. 


For  each  row  i  in  N  with  only  one  1  or  two  Is,  augment  N  with  one  or  two  unit 
vector  columns  e4,  respectively.  Since  none  of  these  columns  could  be  included 
in  an  exact  cover  of  size  q,  D(X 3C)  is  equivalent  to 

D(X 3C'):  Does  there  exist  a  binary  vector  y'  of  length  p+i  such  that  ly*  =  q 

and  (E,N)y'  =  1  where  E  corresponds  to  l  augmenting  columns? 

By  construction  of  D(X 3C’),  no  set  of  columns  of  cardinality  less  than  q  could 
ever  cover  all  the  rows  exactly  once  let  alone  more  than  once.  Thus,  D(X3C')  is 
equivalent  to  a  "minimum  cover  problem" 

Z?(MC):  Does  there  exist  a  binary  vector  y  such  that  ly '  ^  q  and  (RN)/  >  1? 

Let  x=l-y'.  Since  each  row  contains  exactly  three  Is,  D( MC)  is  equivalent  to  a 
"maximum  uncover  problem" 

D( MUC):  Does  there  exist  a  binary  vector  x  such  that  lx^p+f-g  and 
(E,N)x<2? 

Since  all  above  transformations  are  of  polynomial  complexity,  and  since  P(MUC) 
is  an  instance  of  P(GNr),  Z?(CNr)  is  NP-complete.  - 

3.  Algorithms 

The  complexity  results  of  the  preceding  section  indicate  that  solving 
A/(GNr)  and  iff  (GNr  c)  exactly  could  be  very  time-consuming.  Therefore,  heuris¬ 
tic  algorithms  have  been  developed  for  obtaining  approximate  solutions.  We 
describe  the  algorithms  for  A/(GNr)  first. 

M(GNr)  is  an  integer  programming  problem  of  the  form  max  c x  s  t.  Ax<b, 
x  binary,  where  all  data  is  nonnegative.  Thus,  integer  programming  heuristics 
seem  appropriate  for  attacking  this  problem.  Two  basic  heuristic  techniques 
exist  for  solving  such  integer  programs  which  we  label  "addition”  heuristics  and 
"deletion"  heuristics.  An  addition  heuristic  begins  with  the  feasible  solution  x=0 
and  successively  sets  to  1  that  variable  x;-  which  which  myopically  maximizes 
effective  profit.  The  effective  profit  associated  with  xi  is  where  <pj  is  a 

penalty  whose  definition  varies  between  heuristics,  but  which  in  some  way 
reflects  the  units  of  feasibility  used  up  by  setting  Xj  to  1.  The  addition  heuristic 
stop:  when  when  no  additional  variables  can  be  set  to  1  without  violating  feasibil¬ 
ity.  A  deletion  heuristic  begins  with  the  usually  infeasible  solution  jr=l  and  suc¬ 
cessively  sets  to  0  that  variable  Xj  which  myopically  minimizes  loss  of  effective 
profit  Cj/ipj.  Here,  tfij  is  a  penalty  which  reflects  the  amount  of  infeasibility 
currently  being  contributed  by  Xj  =  l.  The  deletion  heuristic  stops  when  when  a 
feasible  solution  is  obtained. 

We  have  specialized  two  addition  heuristics  and  two  deletion  heuristics  to 
M( GNr).  The  addition  heuristics  begin  with  an  empty  CNr  set  and  successively 
add  rows  to  the  set  until  a  maximal  set  is  obtained.  The  deletion  heuristics 
begin  with  an  infeasible  GNr  set  consisting  of  all  the  rows,  and  rows  are  succes¬ 
sively  deleted  until  a  feasible  set  is  obtained.  Since  a  GNr  set  obtained  by  dele¬ 
tion  may  not  be  maximal,  a  second  phase,  an  addition  phase,  is  appended  to 
insure  that  the  set  is  maximal.  To  further  expand  the  GNr  set  found,  it  is  possi¬ 
ble  to  devise  post-maximal  techniques  similar  to  the  2-opt,  3-opt  and  general  k- 
opt  procedures  used  in  traveling  salesman  heuristics,  e.g.,  [16,17],  Application 
of  such  techniques  was  unwarranted,  however,  since  computational  results  in 
section  4  show  that  excellent  approximate  solutions  were  obtained  using  the 
basic  addition  and  deletion  heuristics. 

The  addition  heuristics  are  described  by  Algorithm  CNRa,  with  variations 
"Greedy"  and  "Toyoda.”  The  effective  profit  associated  with  adding  row  i  to  the 
GNr  set  is  1  /  RPi  where  /?Pt  is  a  row  penalty  derived  from  the  current  nonmaxi- 
mal  solution,  the  nonzero  elements  in  the  row  and  feasibility  requirements. 
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Thus,  at  each  step  of  the  algorithm,  the  row  with  the  smallest  penalty  is  added 
to  the  GNr  set.  Feasibility  is  maintained  by  setting  to  infinity  the  row  penalty  of 
any  row  whose  addition  would  cause  an  infeasibility.  In  the  Greedy  variation, 
RPX  equals  the  number  of  nonzero  elements  in  the  row  if  the  penalty  is  finite. 
The  Toyoda  variation  is  a  modification  of  an  integer  programming  heuristic 
developed  by  Toyoda  [23].  In  this  heuristic,  the  finite  row  penalty  RPi  is  based 
not  only  on  the  number  of  nonzero  elements  in  the  row,  but  also  on  how  close  to 
feasibility  limits  addition  of  the  row  would  bring  the  current  solution. 

The  deletion  heuristics  are  described  by  Algorithm  GNRd,  with  variations 
"Dobson"  and  "Senju  &  Toyoda.'  In  this  algorithm,  each  row  has  a  penalty  RPt 
which,  roughly  speaking,  indicates  how  much  infeasibility  the  row  is  contribut¬ 
ing.  1/  RPl  is  the  loss  in  effective  profit  if  row  i  is  removed  from  the  GN#  set. 
Thus,  this  algorithm  successively  deletes  rows  with  maximum  penalty  to  minim¬ 
ize  the  loss  of  effective  profit. 

Dobson  [9]  analyzes  and  gives  worst-case  performance  guarantees  for  an 
addition  heuristic  for  integer  programs  of  the  form  min  c*.  s.t.  Ax^-b,  0<x<u,  x 
integer,  where  all  data  is  nonnegative.  By  simple  substitution  of  variables,  how¬ 
ever,  the  Dobson  heuristic  may  be  interpreted  as  a  deletion  heuristic  for  prob¬ 
lems  in  the  form  of  A/(GNR).  At  each  deletion  step  of  this  heuristic,  RPX  is  the 
number  of  nonzero  elements  in  row  i  which  are  contributing  to  an  infeasibility. 
If  m0  is  the  optimal  solution  to  A/(GNr)  and  is  the  heuristic  solution  obtained 
by  deletion  only,  Dobson's  worst-case  bound  on  performance  is 

(m  -  mD)/  (m  —  m0)  ^  1  /k  where  d  is  the  maximum  number  of  nonzero 

k  =  1 

elements  in  any  row.  This  is  the  only  performance  guarantee  known  for  any  of 
the  heuristics  implemented  in  this  paper.  Unfortunately,  the  upper  bound  on 
m0  this  yields  is  rather  weak  in  practice  (See  Table  3.).  Any  addition  heuristic 
may  be  used  as  a  second  phase  for  a  deletion  heuristic,  but  for  the  Dobson  dele¬ 
tion  heuristic,  we  chose  the  greedy  addition  heuristic  as  the  second  phase  since 
the  definition  of  RPX  is  consistent  between  the  two  phases. 

The  second  variant  of  GNRd  is  a  specialization  of  the  heuristic  devised  by 
Senju  and  Toyoda  [22]  which  those  authors  label  an  "effective  gradient  method." 
For  Af  (GNr),  Hr  maps  the  set  of  feasible  r  values  into  the  n-dimensional  hyper¬ 
cube  whose  sides  are  of  length  2.  At  every  step  of  the  algorithm,  given  current 
infeasible  solution  r,  RPx={Y\t t-2)*  hl,  where  the  jth  element  of  (Hrr-2)+  is 

n 

max  [0,  ^]/ilJ7-i-2j.  RPX  may  be  interpreted  as  the  length  of  the  projection  of 

i  =  l 

the  vector  hl  onto  the  shortest  vector  extending  from  the  point  H7  r  outside  of 
the  hypercube  to  the  boundary  of  the  hypercube.  The  modified  Toyoda  addition 
heuristic  is  used  as  the  second  phase  of  this  heuristic. 

The  two  algorithms  GNRa  and  GNRd,  with  their  variations,  are  outlined  as 
follows: 
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Algorithm  GNRa 

Input:  The  LP  coefficient  matrix  A 

Output:  A  set  of  row  indices  7r  corresponding  to  the  largest  GNr  set  found  in 

A 

Comment:  The  basic  algorithm  is  the  "Greedy"  addition  heuristic.  The  modified 
"Toyoda”  heuristic  is  obtained  by  substituting  the  statement  in 
square  brackets  for  its  predecessor. 

Step  0.  '’Initialization" 

Initialize: 

(a)  /  =  $  and  /'  =  $  1,2 . m 

(b)  For  each  column  j,  a  column  bound 

CBj  =  2 

Comment:  CBj  is  the  number  of  elements  column  j  may  con¬ 
tain. 

(c)  For  each  ie/'.  a  row  penalty 

RP%  =  El 

Step  1.  "Row  Addition” 

Let  RP  -  RPS  be  the  smallest  row  penalty  (corresponding  to  row  se/’). 
If  RP  <  °°  then 

(a)  Move  s  from  I'  to  /. 

(b)  For  each  column  j  such  that  asj  *  0, 

(i)  Let  CBj  -  CBj  -  1. 

( ii )  If  CBj  =0  then  for  each  i*s  such  that  Oy  ^  0,  let 
RPi  =  ». 

[(ii)  For  each  i^s  such  that  a ij  ?  0,  if  CBj  =  1  then  let 
RPi  =  RPi  +  1,  else  let  RPi  =  °°. 

(c)  Repeat  Step  1. 

Step  2.  "Termination" 

Print  Ir  =  I  and  STOP. 

End  of  Algorithm  GNRa 


Algorithm  GNRd 


Input:  The  LP  coefficient  matrix  A. 

Output:  A  set  of  row  indices  / r  corresponding  to  the  largest  GNr  set  found  in 

A 

Comment:  The  basic  algorithm  is  the  "Dobson"  heuristic.  The  "Senju  and  Toy- 
oda”  heuristic  is  obtained  by  substituting  the  statements  in  square 
brackets  for  their  predecessors. 


Step  0.  "Initialization" 

Initialize: 

(a)  /  =  [1,2 . m]  and  /'  =  <f>. 

(b)  For  each  column  j ,  a  column  penalty 

CPi  =  (  2  1 )  -  2. 

“ty*0 

id 


Step  1. 


Step  2. 


Comment:  CPj  is  the  number  of  "excess”  elements  in  column 
3  ■ 

(c)  For  each  iel,  a  row  penalty 

m=  £  i. 

atj*° 

CPj>  a 


Comment:  RP,  is  number  of  units  of  infeasibility  which  row  i 
is  currently  contributing. 

(c)  For  each  ie/,  a  row  penalty 

PPi=  £  cpj. 

«y*° 
cp^>  o 

Comment:  RPi  is  the  sum  of  excess  elements  in  columns  with 
_  a  nonzero  entry  in  rowi. 

"Row  Deletion" 

Let  RP  =  RPi  be  the  largest  row  penalty  (corresponding  to  row  £e/). 

If  RP  >  0  then 

(a)  Move  l  from  I  to  I'. 

(b)  For  each  column  j  such  that  atj  #  0 

( i )  If  CPj  =  1  [If  CPj  >  0]  then,  for  each  i?s  such  that 
ciy  /  0,  let  RPi  =  RPi  -  1. 

(ii)  Let  CPj  =  CPj  -  1. 

(c)  Repeat  Step  1. 

"Row  Addition  Penalties" 

For  each  ie/',  compute  a  row  penalty 


RP,  = 

RPi  = 


2  1  if  CPj  <  0  for  all  *  0 
00  otherwise. 

2  ( CPj  +  3)  if  CPj  <  0  for  all  ay  *  0 

Oy*0 


eo 


otherwise. 


Step  3.  "Row  Addition" 

Let  BE  =  RP*  be  the  smallest  row  penalty  (corresponding  to  row  se/'). 

If  RP  <  then 

(a)  Move  s  from  /'  to  /. 

(b)  For  each  j  such  that  a,j  *  0,  let  CPj  -  CPj  +  1. 

(c)  Go  to  Step  2. 

Step  4.  "Termination" 

Print  /r  =  /  and  STOP. 

Hid  of  Algorithm  GNRd 

The  execution  times  of  the  above  algorithms  and  the  other  algorithms 
described  in  this  paper  are  quite  short  if  proper  data  structures  are  used.  The 
initial  computation  of  the  row  and  column  penalties  can  be  made  very  quickly  if 
the  nonzero  entries  in  each  row  and  column  are  stored  in  a  linked  list.  Column 
penalties  are  then  updated  in  a  single  pass  of  a  row.  Because  of  sparsity,  row 
penalties  can  usually  be  updated  in  passes  through  just  a  few  columns. 
Efficiency  is  further  improved  if  row  and  column  partitions  are  maintained  with 
an  indirect  address  array  which  allows  contiguous  access.  Associated  with  this 
mapping  array,  a  second  array  expresses  the  inverse  map  to  speed  updating. 

An  easily  computable  upper  bound  on  M(GNr),  denoted  UBr,  is  useful  for 
checking  the  efficacy  of  the  above  algorithms.  Algorithm  UBR  is  designed  for 
this  purpose.  Let  Aj  and  Ae  be  a  partition  of  the  rows  of  A  and  let  z ,  z ,  and  z2  be 
the  solutions  to  A/(GNr)  on  A  A!  and  Ag.  respectively.  If  UBi  is  any  valid  upper 
bound  on  M (GNr)  for  Ai,  then 

z  <,  z,  +  z2  <  UBi  +  2z 

Algorithm  UBR  iteratively  applies  the  above  statement,  computing  the  simple 
bound  UBi  and  letting  A=  Ag  after  each  iteration.  This  is  repeated  until  all 
columns  of  Aa  have  at  most  two  nonzero  elements  in  them  at  which  point  z2  is 
equal  to  the  number  of  rows  in  A*.  UBr  is  then  given  by  the  sum  of  the  UBX 
upper  bounds  found  at  each  iteration  plus  Z2  found  at  the  last  iteration.  At  each 
iteration.  A  is  partitioned  with  respect  to  that  column  j  having  the  maximum 
number  of  nonzero  entries.  Aj  is  all  rows  of  Awith  Oy  *  0  and  UB\  -  Zj  =  2  since 
column^'  has  only  nonzero  elements  in  At. 


Algorithm  UBR 

Input:  The  LP  coefficient  matrix  A 

Output:  A  value  UBr,  an  upper  bound  on  |  /r]  . 

Step  0.  "Initialization" 

Initialize: 

(a)  I  =  {1.2 . m  j,  and  UBr  =  0. 

(b)  For  each  column  j,  a  column  count 

cq  =(  £  i). 

icj 

Step  1.  “Iterative  Partitioning" 

Let  CC  =  CQ  be  the  largest  column  count  (corresponding  to  column  l). 
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If  CC  >  2  then 

(a)  Let  UBr  =  UBr  +  2. 

(b)  For  each  ie.1  such  that  ^  0, 

(i)  Delete  i  from /. 

(ii)  For  each;  such  that  a„  *  0,  update  column  count  letting 
CCj  =  CCj  -  1. 

(c)  Repeat  Step  1. 

Step  2.  "Termination" 

Print  UBr  =  UBr  +  |  / 1  and  STOP. 

End  of  Algorithm  UBR 

Algorithm  GNRC,  the  heuristic  for  M(G NRC).  is  outlined  next.  Any  one  of  the 
integer  programming  heuristics  described  for  A/(GNr)  could  be  applied  to  this 
problem.  However,  these  algorithms  will  normally  give  only  a  single  answer  to 
the  problem;  our  algorithm  allows  the  exploration  of  a  complete  trajectory  of 
maximal  GNrc  sets  beginning  with  GNr  and  ending  with  GN^.  Our  algorithm 
begins  with  the  set  of  rows  Ir  found  in  Algorithm  GNRa  or  GNRd  and  repeatedly 
attempts  to  expand  this  set  by  deleting  columns,  always  saving  the  largest  GNrc 
set  found.  This  approach  was  suggested  by  manual  analysis  of  several  problems 
for  which  the  GNR  set  is  limited  by  a  few  key  complicating  columns.  Deleting 
these  columns  produced  a  much  Larger  embedded  GNr.c  set.  and  motivated 
development  of  a  new  factorization  LP  code  which  effectively  exploits  GNrc 
structure  [  19]. 


Algorithm  GNRC 

Input  The  LP  coefficient  matrix  A  and  a  GNr  set  Ir,  |/r|  <  m,  e.g.,  //?  from 
Algorithms  GNRa  or  GNRd. 

Output:  A  set  of  row  indices  /r.c  and  a  set  of  column  indices  JriC  corresponding 
to  the  largest  GNr.c  structure  found  in  A 

Step  0.  "Initialization" 

Initialize: 

(a)  I  =  Ir.  /'  =  { 1,2 . mj  -  /,  J  =  \1,Z . n  j,  /r.c  =  /.  and 

Jr,c  -  J- 

Comment:  /  and  J  are  the  current  sets  of  row  and  column 
indices  while  /r.c  and  /r.c  store  the  best  sets  found. 

(b)  For  each  column  ;'e/,  a  column  penalty 

CPi  =  (  £  1)  “  2. 
oyrO 
1C I 

Comment:  These  column  penalties  remain  as  an  artifact  of 
Algorithm  GNRd  and  can  be  defined  as  input. 

(c)  For  each  ie/'.  a  row  cost 

RCX  =  2  1- 
0 

CPt=  0 

Comment:  /?Q  is  the  number  of  columns  which  must  be 
deleted  if  row  i  is  added  to  I. 


Step  1.  "Column  Deletion" 

Let  gC  =  RCS  be  the  smallest  row  cost  (corresponding  to  row  se/'). 

(a)  For  each  je.J  such  that  a,j  *  0, 

(i)  Let  CPj  =  CPj  +  1. 

(it)  If  CPj  =  1  then  delete  j  from  J  and  for  each  ic/’  such 
that  ay  t  0,  update  row  costs  letting  RQ  =  /?Q  -  1. 

(b)  Move  s  from  /'  to  /. 

Step  2.  "Row-inclusion  Penalties" 

For  each  iel‘,  compute  a  row  penalty 

£  ( cpj  +  1)  if  CPj  <  0  for  all  ay  *  0 

RPi*  %° 

00  otherwise 

Step  3.  "Rowr  Addition" 

Let  RP  -  RPS  be  the  smallest  row  penalty  (corresponding  to  row  seP). 
If  EE  ^0  then 

(a)  Move  s  from  /'  to  / . 

(b)  For  each  jieJ  such  that  aSJ  *  0 

( i )  Let  CP3  -  CP}  +  1. 

( ii )  If  CPj  =  0  then  for  each  ie/'  such  that  a y  *  0,  update 
row  costs  letting  /?Q  =  RCj  +  1. 

(c)  Go  to  Step  2. 

Step  4.  "Incumbent  Test" 

If  |  / 1  +  |  J- 1  >  |  /r.cI  +  I  */r.cI  then  let  /r.c  =  /  and  /R.c  =  J . 

Step  5.  "Termination" 

If  1 1 1  <  m,  then  go  to  Step  1.  Otherwise,  print  /R.c,  ^r.c  and  STOP. 

End  of  Algorithm  GNRC 

A  stronger  test,  allowing  preemptive  termination,  is  possible  at  Step  5:  If 
| / 1  <m  and  m  +  |«/|  >  l  /r.c I  +  |</r,cI-  However,  the  weaker  test  permits  the 
exploration  of  a  complete  trajectory  for  GNr  c  as  discussed  above. 

Along  the  lines  of  UBr,  an  easily  computed  upper  bound  on  Af(GNRc). 
denoted  UBr c.  was  developed  to  check  the  accuracy  of  GNRC.  Partition  A  as  fol¬ 
lows: 

A  -  ^11  |  A]  2 

[AiilAa 

Let  z.  z u  and  z22  be  the  solutions  to  M( GNR)  on  A  An  and  Ac2,  respectively,  and 
let  UB H  be  any  simple  upper  bound  on  Af(GNRc)  for  Au.  Then 

z  ^  zu  +  z22<  UBU  +  z22. 

Algorithm  UBRC  computes  UBr c  by  iteratively  applying  the  above  statement, 
computing  the  simple  bound  VBU  and  letting  A=  Ag2  after  each  iteration.  This 
is  repeated  until  all  columns  of  A^  have  at  most  two  nonzero  elements  in  them 
at  which  point  z2  is  equal  to  the  number  of  rows  plus  the  number  of  columns  in 
As2-  UBrc  is  then  given  by  the  sum  of  the  UB  j  upper  bounds  found  at  each 
iteration  plus  z22  found  at  the  last  iteration.  If  A  u  is  selected  such  that  it  con¬ 
sists  of  single  column  and  three  rows,  all  with  nonzero  elements,  then 
UB\\  =  z  11  =  3.  Computational  experience  has  indicated  an  effective  rule  for 


selecting  the  partition:  Among  all  columns  in  A  having  at  least  3  nonzeros,  select 
that  column  having  the  minimum  number  of  nonzeros,  and  within  that  column 
select  the  first  three  rows  with  nonzeros  in  them.  If  k  partitions  are  carried  out 
before  Azz  becomes  a  GN  matrix,  it  follows  that: 

UDRiC  =  3k  +  \Izz\  +  \Jzz\  =3fc+(|/|— 3*)+(m-*)  =  m  +  m-*- 

The  last  equality  is  used  in  computing  UBrc- 


Algorithm  UBRC 

Input:  The  LP  coefficient  matrix  A 

Output:  A  value  UBRC,  an  upper  bound  on  |/r,cI  +  I^R.cl- 

Step  0.  ’’Initialization'' 

Initialize: 

(a)  /  =  {1,2 . m  j,  and  UBrc  =  |/|  +  \J  I- 

(b)  For  each  column  j,  a  column  count 

cq  =(  £  i ). 

id 

Step  1.  "Iterative  Partitioning" 

Let  CC  =  CCS  be  the  smallest  column  count  greater  than  2 
(corresponding  to  columns). 

If  no  such  column  exists,  go  to  step  2.  Else, 

(a)  Let  UBrq  —  UBr.c  ~  !• 

(b)  For  exactly  three  iel  such  that  A  0, 

(i)  Delete  i  from  /. 

(ii)  For  each  j  such  that  a^j  *  0,  update  column  count  letting 
CC)  =  CCj  -  1. 

(c)  Repeat  Step  1. 

Step  2.  "Termination" 

Print  L'fi’u  c  and  STOP. 

End  of  Algorithm  UBR 


4.  Computational  Experience 

The  algorithms  described  in  section  3  have  been  implemented  in  FORTRAN, 
using  the  X-System  [5]  as  the  host  optimization  package.  Table  1  identifies 
twenty-three  LP  and  mixed  integer  programming  (MIP)  problems  which  have 
been  collected  from  various  sources  over  the  years.  Some  of  these  models  are 
very  well  known,  e.g.,  Dantzig's  PILOT  and  the  U.S.  Department  of  Energy's  PAD 
and  PIES,  and  most  of  them  were  sent  to  us  because  of  their  difficulty,  solution 
expense,  or  outright  solution  failure  on  commercial  optimization  systems.  Table 
1  shows  problem  dimensions  excluding  right-hand  sides  and  objective  functions. 
Computation  times  displayed  in  Tables  2-4  are  compute-seconds,  accurate  to 
the  precision  shown,  for  FORTRAN  IV  H  (Extended)  with  0ptimize(2),  run  on  IBM 
3033AP  under  VM/CMS. 


Table  1.  LP /HIP  Problem  Set 


Problem 

Constraints 

Variables 

Nonzero  Els. 

Model 

AIR 

170 

3,040 

6,023 

Physical  Distribution 

ALUMINUM 

4,045 

8,805 

27,917 

Econometric  Production  4  Distribution 

COAL 

170 

3,753 

7,506 

National  Energy  Planning 

CUBIC  1 

657 

3,074 

15,894 

Combinatorics  Problem 

CUBIC2 

2,689 

11,905 

63,361 

Bigger  Combinatorics  Problem 

CUPS 

360 

618 

1,341 

Production  Scheduling 

ELEC 

784 

2,800 

8,462 

Energy  Production  &  Consumption 

FERT 

605 

9,024 

40.4B4 

Production  &  Distribution 

FOAM 

999 

4,020 

13,083 

Production  Scheduling 

FOOD 

4,010 

14,409 

23,332 

Production,  Distribution  &  Inventory  Planning 

GAS 

788 

5,541 

31,020 

Production  Scheduling 

JCAP 

2,486 

3.&19 

9,510 

Production  4  Shipment  Scheduling 

LANG 

1,235 

1,425 

22,028 

Equipment  &  Manpower  Scheduling 

NETTING 

89 

190 

308 

International  Currency  Exchange 

ODSAS 

4,647 

4,995 

30,832 

Manpower  Planning 

PAD 

694 

3,297 

15,541 

Energy  Allocation,  Distribution  Sc  Consumption 

PAPER 

2,868 

5,318 

23,746 

Econometric  National  Production 

PIES 

662 

3,011 

13,376 

Energy  Production  &  Consumption 

PILOT 

974 

2,172 

12,927 

Energy  Development  Planning 

REFINE 

5,220 

5,994 

40,207 

Oil  Refinery  Model 

STEEL 

831 

1,276 

9,608 

Econometric  Production  &  Distribution 

TRUCK 

220 

4,752 

30,074 

Fleet  Dispatch  (Set  Cover) 

WADDING 

2,991 

15,001 

82,708 

Multicommodity  Prod.  &  Distribution  Planning 

Algorithms  GNRa  and  GNRd  were  used  to  identify  GNR  rows  with  Algorithm 
UBR  used  to  give  an  upper  bound  on  the  total  number  of  such  rows.  To  check 
accuracy,  we  attempted,  within  budget  limitations,  to  solve  exactly  the  integer 
linear  programs  for  A/(GNr)  in  those  cases  where  |/r|  <UB r  .  (Vie  were  success¬ 
ful  in  all  but  one  case,  as  seen.  Times  for  solving  the  ILPs  averaged  214.1 
seconds  for  those  problems  solved.)  Results  for  GNRa  and  GNRd,  given  in  Table  2, 
are  (a)  the  size  of  the  optimal  GNr  set  found  by  the  1LP,  (b)  the  size  of  this  set  as 
a  percentage  of  total  problem  rows  m,  (c)  the  size  of  the  GNr  set  found  by  GNR, 
(d)  the  size  of  this  set  as  a  percentage  of  the  1LP  optimum,  and  (e)  the  time 
required  by  the  algorithm.  For  GNRd,  the  column  labeled  |/r|  uses  the  notation 
a:b  where  a  is  |/r|  and  b  is  the  number  of  rows  in  /r  which  were  gained  in  the 
addition  phase  of  the  heuristic.  Problems  are  weighted  equally  in  computing 
average  percentages  in  the  "Totals"  row  of  the  table.  Times  listed  do  not  include 
input  or  output. 

All  GNR  variants  perform  quite  well.  The  addition  phase  in  GNRd  did  not 
often  contribute  a  significant  fraction  of  the  GN  rows  found,  but  the  additional 
rows  found  helped  make  both  GNRd  variants  slightly  better  than  either  of  the 
GNRa  variants.  The  best  algorithm  on  this  problem  set,  GNRd  (Senju  &  Toyoda), 
finds  an  average  of  99.1%  of  the  maximum  GNr  set  on  those  problems  which  we 
can  solve  exactly.  The  GNr  sets  average  62.3%  of  the  total  problem  rows  on 
these  same  problems.  GNR  computation  times  are  nominal  compared  with 
actual  solution  times  of  the  seminal  LPs  and  MIPs. 


Table  2  Results  for  Algorithms  GNRa  and  GNRd 
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Results  for  UBR,  given  in  Table  3,  include  (a)  the  size  of  the  optimal  GNr  set, 
(b)  the  upper  bound,  (c)  the  upper  bound  as  a  percentage  of  the  ILP  optimum, 
and  (d)  the  time  required  to  find  the  upper  bound.  For  comparison,  we  include 
(e)  Dobson's  upper  bound  labeled  "UBD%"  and  (f)  that  bound  as  a  percentage  of 
the  ILP  optimum.  Table  3  also  displays  some  properties  of  GNr  as  found  by 
GNRd,  Senju  &  Toyoda.  These  properties  include  (g),  the  number  of  disjoint 
embedded  GN  components,  (h)  the  largest  and  smallest  components,  (i)  the 
number  of  null  columns,  and  (j)  the  number  of  singleton  columns.  These  proper¬ 
ties  are  oT  interest  since  the  structure  of  the  embedded  generalized  network 
affects  the  solution  techniques  used  in  an  LP  factorization.  For  example,  com¬ 
ponents  consisting  of  single  rows  may  be  handled  most  efficiently  without  utiliz¬ 
ing  a  complete  generalized  network  code. 


Tabic  3  GNR  Features 


Problem 

«(GNr) 

ILP  Opt. 

Algorithm  UBR 

rJB%  7.  Opt.  Time 

Dobson  Bound 

'JBDr  7.  Opt. 

Total 

Embedded  CNR  Components 

Largest  Smallest  Null 

(mm)  (m+n)  Cols. 

Sing. 

Cols. 

AS 

170 

170 

ICO 

.0 

170 

ICO 

1 

1704  3.040 

- 

0 

57 

2,198 

2.214 

ICO. 7 

1.7 

3,798 

172.8 

145 

1.1184-3.431 

1  +  1 

0 

1.234 

COAX, 

170 

170 

100 

0 

170 

ICO 

1 

1704-3,753 

- 

0 

0 

CUBIC  1 

312 

324 

103  0 

2 

535 

190.7 

30 

150)  1.719 

1  +  2 

124 

012 

CUBIC2 

1.284 

1.332 

1C5  4 

2.7 

2470 

106.1 

140 

5624  0.094 

1  +  0 

353 

2.403 

CUPS 

333 

336 

ICO 

.0 

353 

1C6.0 

13 

GC4-1C2 

12+12 

72 

74 

ELEC 

520 

524 

100  9 

2 

705 

133  6 

14 

74  4  4C8 

2  +  16 

10 

174 

FERT 

572 

572 

ICO 

.1 

CCO 

1C!  9 

1 

57240.024 

- 

0 

1.757 

FOAM 

951 

057 

ICO  6 

0 

091 

1C4  2 

11 

311  +  1,321 

1+1 

14 

1,101 

FOOD 

3.71G 

3.720 

100  1 

.1 

3339 

ICG  0 

75 

1,73547,147 

1  +  4 

522 

e.oeo 

GAS 

73 

74 

101  1 

1 

682 

934  2 

11 

5344,711 

1  +2 

330 

5, C10 

J  CAP 

1.013 

1.C31 

ioi  a 

.2 

2162 

213  4 

130 

1 IC4  4C3 

1  +  2 

82 

1.305 

LANG 

71* 

726 

101  7 

1 

1122 

157  1 

3 

704+1.235 

142 

189 

31 1 

NETTING 

72 

72 

ICO 

0 

04 

1 1C  7 

17 

2C+E0 

2+1 

23 

99  C 

ODSAS 

1.498 

1.513 

ICC  0 

2.3 

4101 

273  1 

115 

701  +2.4C3 

14  4 

537 

1.893 

PAD 

122 

122 

100 

.0 

558 

457  5 

3 

82 1  1.351 

8+33 

1.733 

1.179 

PAPER 

1.839 

l.e63 

101.5 

.4 

2733 

143  7 

4C2 

235+1,631 

1  +  1 

C75 

1,761 

PIES 

288 

1C2.8 

.0 

571 

193  3 

35 

140  +  1.C15 

1  +  2 

S2G 

720 

F"0T 

■170 

•ICO 

10-  3 

.1 

807 

123  7 

78 

177  4  533 

14  1 

CIO 

624 

RL»'i*\E 

3,  \2a 

3. 1 79 

101  G 

8 

4V68 

152  3 

07  t 

1  +  1 

2.153 

STEEL 

431 

458 

1C6.3 

.1 

763 

177  C 

180+541 

1+1 

213 

543 

TRUCK 

NA 

1C5 

NA 

.1 

107 

NA 

2 

69  +  3. 028 

1  +  18 

1,703 

2.315 

WADDING 

2.211 

2,222 

ICC  5 

.2 

2800 

123  8 

3 

969+1. ICO 

1  +  1 

4.414 

5.C32 

UB r  is  surprisingly  tight,  averaging  101.4%  of  the  true  maximum,  and  com¬ 
putation  times  are  nominal.  Dobson's  bound  is  poor,  averaging  203.1%  of  the 
true  maximum.  The  GN  components  found  usually  consist  of  a  few  large  com¬ 
ponents  and  numerous  small  components. 


Table  4  gives  the  results  obtained  by  Algorithm  GNRC  and  Algorithm  UBRC. 
Since  no  ILP  optimum  is  known  for  Af(GNRc)  in  most  cases,  the  items  displayed 
differ  from  those  items  displayed  in  Tables  2  and  3.  The  results  reported  for 
Algorithm  GNRC  are  (a)  the  size  of  the  of  the  GNrx  structure  found,  (b)  the  time 
in  seconds  required  to  find  the  structure  excluding  input  and  output,  (c)  the  size 
of  the  GNr,c  as  a  percentage  of  the  total  constraint  matrix,  and  (d)  the  percen¬ 
tage  of  of  the  LP  upper  bound  (UBLPf^ c)  achieved  by  the  algorithm.  The  results 
reported  for  Algorithm  UBRC  are  (e)  |/r.cI  +  I«/r.cI  as  a  percentage  of  UBrc,  and 
(f)  the  time  required  to  obtain  C/B^c-  For  comparison,  the  last  two  columns  of 
the  table  give  the  total  number  of  rows  and  columns  obtained  for  the  GNr  and 
GNc  problems.  These  are  the  sizes  of  the  embedded  GN  submatrices  when  res¬ 
tricted  to  row  submatrices  and  column  submatrices,  respectively.  Each  prob¬ 
lem  is  weighted  equally  to  compute  average  percentages  in  the  "Totals"  row. 


Table  4.  GNRC  Results 

Algorithm  GNRC 

Alg.  UBRC 

GNr 

GNC 

Problem 

KrcI  +  I^r.cI 

Time 

%  (m-fn) 

%  UBLPrc 

%  UBk  c 

Time 

|/Rl+n 

|/cl+|ra 

AIR 

3,210 

.0 

1 00 

100 

100 

.0 

3,210 

3,210 

ALUMINUM 

9,027 

13.6 

B3.2 

91.7 

91.7 

2.3 

8,980 

5,508 

COAL 

3,923 

.0 

100 

100 

100 

.0 

3,923 

3,923 

CUBIC  1 

3,365 

.8 

90.2 

99.4 

94.8 

.4 

3,385 

859 

CUBIC2 

13,096 

11.1 

89.7 

09.5 

94.7 

6.4 

13,096 

2,600 

CUPS 

951 

.0 

97.2 

100 

99.7 

.0 

951 

713 

ELEC 

3,322 

.3 

92.7 

99.0 

98.3 

.2 

3,320 

1,042 

FERT 

9,596 

.3 

99.7 

100 

99.9 

.2 

9,596 

2,362 

FOAM 

4,971 

.1 

99.0 

100 

09.7 

.1 

4,971 

1,044 

FOOD 

18,137 

.8 

98.5 

99.5 

09.4 

.1 

18,125 

17,860 

GAS 

5,920 

5.4 

93.5 

94.9 

94.5 

.2 

5,614 

848 

JCAP 

5,822 

5.5 

91.9 

97.7 

99.8 

.2 

4,851 

5.718 

LANG 

2,139 

1.1 

80.4 

97.0 

90.2 

.2 

2,139 

1,905 

NETTING 

262 

.0 

03.9 

97.8 

100 

.2 

262 

256 

ODSAS 

7,556 

40.0 

78.4 

70.0 

86.1 

1.2 

6,470 

5,094 

PAD 

3,621 

3.9 

90.7 

98.8 

95.3 

.3 

3,419 

2,416 

PAPER 

7,388 

4.8 

89.9 

95.9 

98.2 

.9 

7,179 

4,905 

PIES 

3,313 

.9 

90.2 

99.5 

94.8 

.2 

3,299 

2,241 

PILOT 

2,645 

1.4 

84.1 

95.7 

91.6 

.2 

2,634 

1,587 

REFINE 

9,326 

19.3 

83.2 

93.8 

62.4 

2.3 

9,104 

7,729 

STEEL 

1,700 

.9 

60.7 

91.5 

69.7 

.2 

1,605 

1,131 

7HVC  K 

4,822 

5 

97.0 

NA 

90.3 

.3 

4.373 

220 

17,209 

8.3 

95.6 

99.7 

97.8 

1.0 

17,209 

14,451 

Totals 

141,321 

118.1 

91.3% 

96.8% 

05.6 

17.1 

138,232 

87,582 

Note:  NA  indicates  not  available. 


GNRC  performs  very  well,  also.  The  algorithm  finds  a  GNr,c  structure  whose 
size  averages  91.3%  of  the  size  of  the  total  constraint  matrix.  The  size  of  the 
structure  averages  96.8%  of  the  LP  upper  bound  on  those  problems  for  which  the 
bound  was  obtained.  (Times  to  obtain  the  LP  bound  averaged  315.6  seconds.) 
With  respect  to  UBrc,  the  GNrx  set  found  averages  95.6%.  Thus,  the  upper 
bound  provided  by  algorithm  UBRC  is  only  slightly  weaker,  on  average,  than  the 
LP  upper  bound.  In  addition,  UBRC  has  more  than  a  400  to  1  computational 


speed  advantage  over  the  LP  upper  bound  making  it  very  attractive. 

Additional  computational  studies  have  been  performed  to  investigate  the 
structures  which  GNR  and  GNRC  obtain.  Figure  1  summarizes  this  work  for 
ELEC,  JCAP,  PAD.  PIES  and  PILOT.  The  outer  rectangle  represents,  to  scale,  the 
constraint  matrix  for  each  problem.  The  area  above  the  dashed  line  represents 
the  GNr  set  found  by  GNRd,  Senju  &  Toyoda.  Within  this  area  are  indicated  the 
connected  components  found  by  a  simple  connectivity  algorithm.  As  indicated 
previously  in  Table  3,  a  few  large  components  are  typically  found  together  with 
numerous  small  components.  The  area  to  the  left  of  the  vertical  line  represents 
the  GNc  set.  The  irregular  lines  trace  the  trajectories  of  the  GNrc  structures 
found  by  GNRC,  ranging  from  GNr  on  the  right  to  GNc  at  the  lower  left.  From  any 
point  on  this  trajectory,  all  rows  and  columns  above  and  to  the  left  form  a  GN 
set.  The  circle  indicates  the  largest  GNr_c  structure  found  on  this  trajectory. 

5.  Conclusion 

Although  GNc  identification  is  easy,  GNr  and  GNr  c  identification  is  theoreti¬ 
cally  difficult.  However,  maximal,  and  often  optimal  GNr  and  GNrC  substruc¬ 
tures  can  be  found  in  an  LP  constraint  matrix  using  the  heuristic  algorithms 
developed  here.  In  some  problems,  large  GNr  structures  can  be  found,  while  in 
other  problems,  it  is  necessary  to  remove  some  columns  to  find  a  large  embed¬ 
ded  GNrc  structure.  Since  execution  time  is  modest  for  heuristic  GN 
identification,  our  algorithms  can  be  applied  as  a  matter  of  course  in  general 
LPs  to  seek  GN  substructures.  Evidence  from  the  problem  set  indicates  that 
this  is  well-advised  if  a  GN-exploiting  method  is  available:  No  members  of  the 
problem  set  were  known,  a  priori,  to  contain  significant  GN  structure  and  yet,  in 
several  cases,  GN  structure  was  predominant. 
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Figure  1.  (continued) 
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